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Abstract — This paper re-examines the well-known fundamen- 
tal tradeoffs between rate and reliability for the multi-antenna, 
block Rayleigh fading channel in the high signal to noise ratio 
(SNR) regime when (i) the transmitter has access to (noiseless) 
one bit per coherence-interval of causal channel state information 
(CSI) and (ii) soft decoding delays together with worst-case 
delay guarantees are acceptable. A key finding of this work 
is that substantial improvements in reliability can be realized 
with a very short expected delay and a slightly longer (but 
bounded) worst-case decoding delay guarantee in communication 
systems where the transmitter has access to even one bit per 
coherence interval of causal CSI. While similar in spirit to 
the recent work on communication systems based on automatic 
repeat requests (ARQ) where decoding failure is known at the 
transmitter and leads to re-transmission, here transmit side- 
information is purely based on CSI. The findings reported here 
also lend further support to an emerging understanding that 
decoding delay (related to throughput) and codeword blocklength 
(related to coding complexity and delays) are distinctly different 
design parameters which can be tuned to control reliability. 

I. Introduction 

It is well known that multiple antennas can substantially 
increase the capacity of a point-to-point fading channel and 
can also significantly improve the reliability of communica- 
tions via space diversity. Interestingly, both these gains can 
be obtained without having any CSI at the transmitter [1], 
[2]. In fact, even with full non-causal transmit CSI, the 
ergodic capacity of multi-antenna links cannot be improved 
substantially at high SNR (specifically, the scaling law of 
the capacity with SNR remains the same). This motivates 
the question of whether any knowledge of the channel at 
the transmitter could improve the reliability of multi-antenna 
channels. 

There is a large body of work in characterizing the reliability 
gains of multiple-input multiple-output (MIMO) block-fading 
channels [2]-[4]. When the transmitter has no CSI, Zheng 
and Tse [1] characterized the tradeoffs between the rate and 
the reliability exponent at high SNRs by analyzing what is 
referred to as the outage event and proving that the overall 
error probability is dominated by the probability of the outage 
event. Specifically, outage refers to the event that the channel 
realization during the code length interval is too poor to 
support the given rate [5]. It turns out that for a channel 
with coherence interval L, the reliability exponent of a block 
code of blocklength / depends only on rate and the number of 
antennas and does not depend on the code length / (when I is 



larger than the total number of transmit and receive antennas). 
In this setting, the decoding delay is, of course, equal to the 
code blocklength 

Any side information which is correlated to the channel 
conditions when made available to the transmitter, including 
causal CSI [6] and/or decoder statu^ [7]-[ll], can potentially 
improve the rate and reUabiUty. In particular, it is well known 
that feedback can substantially improve the error exponent of 
additive white Gaussian noise (AWGN) channels via variable 
length coding and power control assuming no strict delays and 
peak power constraints [7]-[ll]. For block-fading channels, 
in [3] an automatic retransmission request (ARQ) is shown to 
substantially improve the diversity gain by allowing codeword 
retransmissions (random variable-length channel codes) with 
the aid of noiseless one-bit decision feedback and power 
control. Therefore, in a communication system with feedback, 
the decoding time T (^channel uses) is a random variable 
depending on I and the state of the decoder In order to capture 
this random decoding delay, Burnashev introduced the notion 
of error exponents for a feedback code which transmits one 
of M messages in expected time at most I [8]. 

In this paper, our focus is on the former type of transmitter 
side information, namely causal CSI. In particular, we assume 
that the transmitter has access to only one noiseless bit 
of causal side information per coherence interval from the 
receiver which describes the channel state as being "good" 
(above a predecided threshold) or bad (below the threshold). 
This type of channel state feedback has substantially less 
coding complexity than that of the ARQ systems as it only 
requires the knowledge of the channel state and not the status 
of the decoder Following [8], we define the reliability function 
as the smallest error probability that can be achieved by a 
code of length I which transmits one of p''^ messages in the 
expected time at most [l + e) where r is the multiplexing 
gain and e tends to zero as the SNR p goes to infinity. By 
leveraging this side information and the notion of an expected 
decoding delay, we show that the reliability function can be 
substantially improved in the high SNR regime. In our scheme, 

'Throughout the paper, we assume the code blocklength I is less than or 
equal to the coherence interval L of the channel. Results for i > L are 
straightforward generalizations but are omitted. 

-Technically, one distinguishes between causal CSI which is independent of 
the message and is available at the transmitter before the next transmission and 
decoder feedback which depends on the channel outputs after a transmission. 



transmission occurs only when the channel satisfies a certain 
condition which can be ascertained at the receiver. Therefore, 
similar to feedback channels, the decoding delay is random 
and rate may be reduced as we do not use the channel all the 
time [3], [9], [11]. However, it turns out that in the regime of 
high SNR, the channel most likely satisfies the conditions and 
thus the scaling law of the rate remains unchanged. 

When there is no delay constraint and noiseless one-bit 
causal CSI feedback is available, we show that the outage 
events can be completely avoided and an error exponent which 
is as good as that of a pure multi-antenna AWGN channel 
without fading can be attained. In this case, in sharp contrast to 
the case where the transmitter has no CSI, the error probability 
approaches zero exponentially with the blocklength for all 
multiplexing gains. We further consider the case where the 
decoding delay has a strict deterministic deadline, that is, a 
finite maximum delay constraint, in which the receiver has 
to decode the message in at most D coherence intervals. In 
this case, it is clear that outage events cannot be avoided and 
the effects of fading will reappear However, we show that a 
substantial gain on the reliability function can be achieved 
even with a specified finite maximum decoding delay. For 
instance, in a channel with M transmit antennas, and one 
receive antenna, %90 of the diversity gain of an AWGN 
channel can be attained with a worst case delay of 13 = 
\ogM (10//M). This shows when / = 10 and M — 3, we only 
need a delay of Z3 = 4 to completely overcome the effects 
of fading and approach the performance of MIMO AWGN 
channels. 

This paper is organized as follows: In Section |ll] we 
present the channel model and introduce the problem set-up. 
In Section Uni we obtain the reliability exponent when there is 
no bound on the worst case delay. In Section |IV] we analyze 
the case where the maximum decoding delay is bounded and 
treat the scenarios with and without a finite peak to average 
power constraint. 

II. Background 

A. Channel Model 

We consider a frequency-flat block-fading MIMO channel 
with M transmit and receive antennas. The channel is 
assumed to remain fixed for L channel uses (the channel 
coherence interval) and change independently to another state 
in the next block. Therefore, within each coherence interval, 
the received signal at time t can be expressed as, 

yt^ ^HSt + Wt, t=l,...,L, (1) 

where p is the average transmit power, H denotes the (fixed) 
N X M random fading channel matrix, and for each discrete 
time t, St denotes the M x 1 channel input symbol and 
Wt the X 1 additive white Gaussian noise vector. All the 
entries of H and Wt are assumed to have a zero-mean, unit- 
variance, complex circular Gaussian distribution. Furthermore, 
the transmit message St at each time t is required to satisfy 
the average power constraint E{SlSt) < 1. In all random 



coding bounds for the reliability function, we assume St to 
be a random Gaussian codeword with blocklength I < L. The 
I > L case can also be analyzed but is omitted for brevity. 

B. Reliability, Rate: No CSI at the Transmitter 

Rate versus reliability tradeoffs have been well studied for 
point-to-point AWGN channels through the notions of the 
error exponent (large blocklength) and the diversity gain (large 
SNR). For fading MIMO channels with no transmit CSI, 
Zheng and Tse studied the smallest error probability that can 
be achieved in the asymptotic of high SNR by a code of size 
/?'''. In this regime, the error probability is characterized by 
the diversity gain defined as, 

where R = is the code rate in bits per channel use, 

I is the code blocklength, and is the smallest achievable 
error probability for a rate-i?, blocklength / code. It is further 
shown that when the transmitter has no CSI, d{r) is given 
by a piecewise linear function connecting the points (r, [M — 
r){N — r)) for r = 0, 1, . . . , min(Af, N). More explicitly, 

d{r)=ak-rl3u, fc - 1 < r < fc, (3) 

where 1 < fc < min(A//, A^), au = MN - 2k{M + N) + 
3k^ - fc, and Pk = M + N - 2k + 1. 

It is interesting to note that unlike AWGN channels, the 
error probability of fading channels (without transmit CSI) 
tends to zero with an exponent independent of the code 
blocklength as long as / > {M + A^). This is due to the 
fact that the error probability is dominated by the so-called 
outage error event: the event of having a sequence of atypically 
poor channel fading gains over the duration of a codeword. In 
this setting, in [1] it is shown that the optimal diversity gain 
can be achieved by using a fixed-length blockcode and when 
the decoding delay is equal to the length of the codeword 
blocklength I. 

C. Reliability, Rate, and Decoding Delay: Transmit CSI 

Transmit side information in its broadest sense includes both 
CSI and the received signal at the decoder (feedback channels). 
It is folklore that while feedback cannot improve the capacity 
of a discrete memoryless channel, it can improve the reliability 
function (see [6], [7] and references therein). The gain in the 
reliability is achieved by leveraging (i) the possibility of re- 
transmissions while keeping the codeword blocklength fixed, 
and (ii) using power control to boost the power when rare 
events happen [7]. This implies that the decoding time T 
is random and no longer equals the codeword length. For 
general DMCs, in [8], Burnashev obtained tight bounds on the 
smallest error probability of a code which transmits one of the 
messages in expected time at most I, that is, E(T) < / and 
unbounded maximum delay [8], [9]. In the asymptotic of large 
SNR, Gamal, Caire, and Damen proposed an ARQ scheme and 
proved that a single noiseless bit of feedback per codeword 
block pertaining to the status of the decoder can substantially 



improve the reliability even when a finite maximum decoding 
delay constraint is imposed, that is, with a finite total number 
of retransmissions. 

Inspired by these results on feedback channels, we explore 
the high-SNR asymptotics of the channel reliability function 
with one noiseless bit of causal CSI at the transmitter We 
show that with this limited transmit CSI and for a fixed code- 
word blocklength, the reliability function can be substantially 
improved even with a strict finite maximum decoding delay 
constraint. 

Here, the reliability function is defined in terms of the 
smallest achievable error probability for a blockcode of block- 
length I, size p' ', and maximum decoding time of ID (that is, 
T < ID). In particular, following [8], we define the diversity 
gain with CSI as 



di„bji(r, D) = liminf- 



logEn {Pe{R, pJ,M, N,D)} 



logp 



(4) 



where R = '^^^^ , T is decoding time, and the multiplexing 
gain r is defined as. 



r = iim 1 . 

p^oologp 



(5) 



In this paper we assume that the transmitter knows, through 
one bit receiver feedback, whether or not the channel realiza- 
tion satisfies a predecided criterion before transmission. The 
transmitter leverages this side information to postpone the 
transmission upto the time that the channel is favorable. When 
D is bounded and the transmitter has delayed the transmission 
for {D — 1) coherence blocks, the message will be sent in 
the next coherence block. In what follows, we investigate the 
behavior of rfi„bit(r, D) for different values of D. 

III. Diversity-Multiplexing Tradeoff: Bounded 
Expected Delay, Unbounded Maximum Delay 

The reliability of MIMO fading channels without transmit 
CSI is dominated by the probability of the outage event [1]. 
However, in the asymptotic of large SNR, the outage event is 
a rare event with a small probability of p~'^^^'> where d{r) is 
defined in Q. Therefore knowing whether or not the channel 
is in outage, messages can be scheduled for transmission only 
during favorable conditions. This would eliminate the atypical 
outage events and may also lead to unbounded decoding delay, 
though the expected delay can still be bounded. This is due 
to the fact there is a nonzero probability, however small, of 
arbitrarily long sequence of outage events. 

When there is no maximum decoding delay constraint, 
we assume that the receiver sends one bit to the transmitter 
at the beginning of each coherence interval indicating the 
occurrence/non-occurrence of the event: 

0^,00 : logdet{I + pH*H) < min(M, iV)(log /9-2 loglogp). 

A message is transmitted only when the channel fails to 
satisfy the condition Or, oo- The following theorem provides 
the reliability function defined in ^ with D infinite. 



Theorem 1 (Lower bound on diversity gain) Consider 
the channel of Section I//-AI with noiseless one-bit causal 
transmit CSI confirming or denying condition Or.oo- For any 
blocklength-l, I < L, there exists a block code with p^^ 
codewords for which 



di-Mt{r, oo) > l{min{M, N) - r), 



(6) 



where di-ut is as defined in 0. 

Remark: It is interesting to note that the error probability 
is now decreasing exponentially with the blocklength unlike 
the case where there is no transmit CSI. Comparing this error 
exponent with that of an AWGN channel with the same SNR, 
it is seen that the effect of channel fades can be completely 
removed. This substantial gain is obtained due to the relaxed 
(but practical) requirement on the decoding delay from being 
exactly equal to the codeword blocklength to being equal to the 
expected number of channel uses until codeword reception. It 
is straightforward to establish that the average decoding delay 
is equal toZ(l + e((i^)) 

Proof-sketch: By bounding the determinant of a matrix by 
its minimum eigenvalue and its trace, it readily follows that 
Pr(C'r.oo) = 9 ^ (log^p)^ ^ ■ This implies that the decoding 
delay T has a geometric distribution with parameter 1 — 
Pr(C'r.oo)- Therefore, the multiplexing gain as defined in ^ 
is equal to. 



lim 

p-*ca\ogp 



lim 

P^oo/log p/{\ 



rl log p 



Pr(a,oo)) 



(7) 



An achievable error probability can be found using the random 
coding bound. Following [2], we may write. 



Jc 



fH{H)dH 



< 



{det{I + pH*H)y 



where oo is the complement of the event Or. oo- H 

In the following section, we investigate the scenario in 
which the message decoding delay is bounded. 

IV. Diversity-Multiplexing-Delay Tradeoff: 
Bounded Maximum Delay 

In the previous section, we observed that a substantial gain 
can be obtained in the diversity gain with little CSI at the 
transmitter at the cost of having the possibility of infinite 
decoding delays (albeit with arbitrarily small probability for 
large SNR). In this section, we further impose a worst case 
delay constraint of D for message decoding. In other words, 
we assume that the decoder has to decide on a message at most 
in D coherence intervals of the channel. Clearly, this would 
imply that there is a non-zero probability for a sequence of 
really bad channel events. 

When D = 1, the problem reduces to the problem consid- 
ered in [1] with the only difference that the transmitter has 
1-bit causal CSI. Since there is no possibility for postponing 
the transmission and power allocation optimization for one 



coherence interval does not change the diversity gain, it is 
straightforward to show that di^huir, 1) = d{r]^ . 

In this section, we prove that with only one bit causal CSI 
feedback at the transmitter, one can substantially improve the 
diversity gain of codes of length / by leveraging the possibility 
of postponing the transmission and/or exploiting the long-term 
average power constraint. This substantial gain is achieved by 
only transmitting when the channel is in a favorable condition 
or we have reached the maximum transmission delay. Since 
the event of having a sequence of unfavorable channels is very 
unlikely, one can, without violating a long-term average (over 
messages) power constraint, boost the transmit power when 
faced with greater delays. We also explore the potential gains 
for a (short-term) peak to average power constraint. 

A. Short-Term Average Power Constraint 

Here we assume that the receiver sends one bit feedback to 
the transmitter confirming or denying the following condition: 



Or^D ■■ logdet(/ + pH*H) < f{r, D) log p. 
where /(r, D) is defined as, 

d{r) + {D- l)Meq{mm{M, N) - r) 

l+iD- l)Meg ' 



f{r,D) 



(8) 



(9) 



where I is the code blocklength, Al^q = |A/ — + 1, and d{r) 
is as defined in (O. Of course, this is the outage event and 
has a probability of p~'^(f(^'^)) , In our transmission scheme, 
the transmitter postpones the transmission until the channel 
belongs to the set of channels that do not satisfy the condition 
Or,D or reaches the maximum delay constraint of D. In 
this transmission scheme, transmission always occurs with a 
transmit power p. 

Theorem 2 For the channel of Section I//-A1 the diversity 
gain of a code of blocklength I and cardinality p^\ with short- 
term power constraint p, is at least equal to 

di-Mt{r,D) > l{f{r,D)-r) = 
-^—^J—^y{r) + {D- l)A4,(min(A/, N) - r)) 

for any [M + N) < I < L, where M^q = \M - iV| + 1, 
< r < min(il/, N), and d(r) is as defined in Q- 

Proof-sketch: We use the random coding bound to obtain 
an upper bound on the error probability. We condition the 
error probability into two events, namely, i) Ai: the channel 
realization satisfies jj in at least one out of the first (D — 1) 
coherence intervals, ii) A^: the channel satisfies Or.o during 
all {D — 1) coherence intervals. The error probability in the 
event of Ai can be easily bounded using the same approach 
as in Theorem 1 as. 



l{r-f(r,D)) 



(10) 



^The reason being that in the regime of large SNR, the exponent of the 
scaling law of the outage probability with SNR would not be changed by 
optimal power allocation over one coherence interval. 




Fig. 1. Diversity -multiplexing tradeoffs with one-bit noiseless causal 
transmit CSI for different values of D for M = N = 2 and I = 8. 



Also, note that the probability of the event A^ is the event 
that condition Or,D (cf. (HJ) is met {D — 1) times. Therefore, 



Pr{Al) < p 



-{D-l)d{f(rM}) 



Furthermore, conditioned on the event A^, the error probabil- 
ity is given by the error probability for D ~ 1. Therefore, 

E(Pe) = E{P,,A,)+EiPe,Al) 

< pKr~fir.D)) p-(D-l)d{f{rM))^dir)^ ^^^^ 

In order to calculate the largest achievable diversity gain, one 
needs to minimize the upper bound on the error probability in 
( fTTT ). that is, match the exponents by finding f{r,D) which 
satisfies. 



lir - f{r, D)) = -{D - l)d(/(r, D)) - d{r). 



(12) 



The expression in ^ solves (fTSl i with d{f{r,D)) replaced 
by the smaller value Meg(min(A/, N) — f{r, D)). This corre- 
sponds to minimizing a larger upper bound (than in ( fTTT i) on 
the error probability. Replacing f{r, D) in (fTTT i completes the 
proof. ■ 

Specializing Theorem 2 to the case = 1 gives, 
1{MD - 1) 



di-bit{r,D) > 



(13) 



l + M(D-l) 

for any < r < 1. This shows that when AID ^ I, 
di-bit > l{l - r) and when I > A/L», di^m > [J^ID - 
1)(1 — r). Figure [U shows the diversity multiplexing tradeoff 
for increasing values of D. It is clear that with little side 
information and a small delay tolerance, the reliability function 
can be significantly improved. In the next section, we show 
that the gains with bounded delay can be further increased by 
exploiting a long-term power constraint, that is, by boosting 
the power when rare events happen. 

B. Long-Term Average Power Constraint 

In the regime of high SNR, the outage event has a very 
small probability and therefore the transmitter can boost the 
power in proportion to the corresponding probability when the 



transmitter is subject to a long-term power constraint. In this 
section, we show that we can further improve the reliability 
function using power control. 

Side information here is assumed to be one bit causal CSI 
that informs the transmitter whether the following condition 
is true or false: 



logdet(/+p9'('-'^'i7*i7) < Mr,D)\ogp. (14) 



where i denotes the number of consecutive channels that sat- 
isfies the conditions Or,D,i, ■ ■ ■ , Or,D,i-i, respectively. Here 
fi{r, D) and gi{r,D) are defined as. 



D 



Oik 



M 



(15) 



where gi{r,D) = 1. Here gi{r,D) is the transmit power 
when the previous i — 1 channels satisfy all the conditions of 
Or^D,i, ■ ■ ■ , Or,D,i-i- The ncxt theorem provides an achiev- 
able upper bound on the error probability that implies a lower 
bound for the diversity gain. 

Theorem 3 For the channel of Section l//-Aj the diversity 
gain of a code of blocklength I and cardinality p'"', with long- 
term average power constraint p, is at least equal to 

di-Mtir,D) > , , J {mm{M,N)M,gDeq 



I + DeqMeq 



+ak-{Deq+ (3k)r). 



where for M ^ N, 



M 



1, 



(16) 



D{D+1) 



and for M — N, we have D^q 

Proof-sketch: The proof follows again using the random 
coding upper bound on the error probability. The only ad- 
vantage here is that the probability of not transmitting until 
the D'th coherence interval is much smaller as we can use 
power control. This can significantly improve the achievable 
diversity gain. ■ 

In the special case where = 1, Theorem 3 implies that 
the achievable diversity gain is no smaller than 



di-ut{r,D) > 



M 
M- 



D 



1 + 



M 

M-1 



D 



M 



-il-r). 



(17) 

Figure [T] compares the achievable diversity gains with and 
without power control as obtained in (fTTT i and (fTsT l, respec- 
tively. It is clear that with only D = 4, we can obtain most 
of the gains achieved via infinite worst case delay obtained 
in We can quantify this observation by computing the 
delay D required to achieve within e of the lower bound for 




Fig. 2. For a system with M = 3, N = 1, and I = 10, the figure 
with respect to D for any r and for the two cases 



shows -j V 

with long-term/short-term power constraints. 



di-bit{r, oo), that is, 1(1 — e)(l — r). It is straightforward to 
show that the delay required is equal to 

1) D — with no power control, 

2) D = logjj(^) with power control. 

Therefore with power control, we need an exponentially 
smaller delay to achieve the same performance as opposed 
to the case where no power control is employed. 
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